Fine tuning LLM for text classification -- error with SFTTrainer
|
|
2
|
1361
|
June 3, 2025
|
Crisp AI to AI language the road to AGI
|
|
1
|
15
|
May 29, 2025
|
Why do custom development?
|
|
4
|
42
|
May 28, 2025
|
Simplifying Hugging Face Spaces API calls in Flutter using hugging_face_chat_gradio package
|
|
1
|
16
|
May 26, 2025
|
Trouble fine-tuning Flan-T5 (with LoRA) for structured map generation – model repeats prompt or instructions
|
|
1
|
13
|
May 26, 2025
|
Why is the memory quickly filled up in the first few iterations when using Trainer of transformers to train the network, and then drops to a very low level as the training progresses?
|
|
0
|
11
|
May 25, 2025
|
Dario Schiraldi : How can I set up a commercially viable workflow in ComfyUI to perform accurate face-swapping?
|
|
0
|
25
|
May 22, 2025
|
How to forbade Gemma 2 from using a certain phrase and use another one in its place?
|
|
7
|
16
|
May 21, 2025
|
Dedicated endpoint getting 429 errors
|
|
4
|
69
|
May 21, 2025
|
429 for Kokoro-82M model
|
|
1
|
31
|
May 19, 2025
|
GradioUI + Smolagents + MCP "Event loop is closed"
|
|
1
|
39
|
May 16, 2025
|
Program not working on GPU but works on CPU
|
|
22
|
146
|
May 16, 2025
|
🚀 New tool for AI manga creators: **MangaBuilder** (buildmanga.com)
|
|
2
|
26
|
May 16, 2025
|
Handling Extreme Class Imbalance for Multi-Class Classification
|
|
1
|
27
|
May 14, 2025
|
Matching Single Shoes with Computer Vision – Alternatives to Cosine Similarity and Siamese Networks need advice
|
|
3
|
13
|
May 12, 2025
|
Resize embeddings on Peft model
|
|
4
|
446
|
May 12, 2025
|
Blip2 peft training
|
|
2
|
188
|
May 9, 2025
|
How to setup JSON based workflow/flowchart generation based on user prompt?
|
|
1
|
33
|
May 9, 2025
|
Cuda OOM on 4 A6000s (142 GB of VRAM) even after using Zero3, Qlora, Accelerate, Max_token_length
|
|
1
|
46
|
May 8, 2025
|
How do i batch in streaming of data set
|
|
1
|
38
|
May 3, 2025
|
Help with Quantizing phi-4 MM Fine-Tuned Vision Model and Converting to ONNX
|
|
3
|
52
|
May 2, 2025
|
Checking if two column have the language i want
|
|
1
|
26
|
May 1, 2025
|
Strange pyarrow error when extracting rows from a public dataset
|
|
2
|
28
|
April 30, 2025
|
A Poem that help LLM improve quality & reduce 50% overhead
|
|
0
|
22
|
April 29, 2025
|
Gradio Chatbox - no api found
|
|
1
|
46
|
April 29, 2025
|
Sudden Loss Drop and Poor Performance During Model Training
|
|
0
|
33
|
April 28, 2025
|
🔧 Optimizing Phi-4 MM Instruct Vision Model (ONNX Inference)
|
|
1
|
41
|
April 24, 2025
|
Can anybody recommend a good image filename generating AI?
|
|
1
|
23
|
April 24, 2025
|
Arabic to French Word embedding Using skip-gram needs new Ideas in the data part
|
|
0
|
31
|
April 23, 2025
|
Cache Proxy - Like with Docker Registries
|
|
1
|
417
|
October 21, 2024
|